Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution
نویسندگان
چکیده
This paper presents a new information acquisition problem motivated by business applications where customer data has to be acquired with a specific modeling objective in mind. In the last two decades there has been substantial work in two different fields optimal experimental design and machine learning – that has addressed the issue of acquiring data in a selective manner with a specific objective in mind. We show in this paper that the problem presented here is different from the classic model-based data acquisition problems considered thus far in the literature in both fields. Building on the work in optimal experimental design and in machine learning we develop a new active learning technique for the information acquisition problem presented in this paper. We demonstrate that the proposed method perform well based on results from applying this method across twenty Web usage and machine learning datasets.
منابع مشابه
Selective Data Acquisition for Machine Learning
In many applications, one must invest effort or money to acquire the data and other information required for machine learning and data mining. Careful selection of the information to acquire can substantially improve generalization performance per unit cost. The costly information scenario that has received the most research attention (see Chapter X) has come to be called ”active learning,” and...
متن کاملOn Active Learning for Data Acquisition
Many applications are characterized by having naturally incomplete data on customers – where data on only some fixed set of local variables is gathered. However, having a more complete picture can help build better models. The naïve solution to this problem – acquiring complete data for all customers – is often impractical due to the costs of doing so. A possible alternative is to acquire compl...
متن کاملActive Feature Acquisition for Classifier Induction
Many induction problems, such as on-line customer profiling, include missing data that can be acquired at a cost, such as incomplete customer information that can be filled in by an intermediary. For building accurate predictive models, acquiring complete information for all instances is often prohibitively expensive or unnecessary. Randomly selecting instances for feature acquisition allows a ...
متن کاملSolution of Backup Multifacility Location Problem by Considering the Ideal Radius for each Customer
In this paper we introduce a new facility location model, called backup multifacility location problem by considering the ideal radius for each customer. In this problem the location of clients are given in the plane. A radius is assigned to each client. We should find the location of new facilities, which some of them may fail with a given probability, such that the sum of weighted distances f...
متن کاملActive Learning: An Approach for Reducing Theory-Practice Gap in Clinical Education
Introduction: The gap between theory and practice in clinical fields, including nursing, is one of the main problems that many solutions have been suggested to eliminate it. In this article, we have tried to investigate its solution through active learning. Methods: In this review article, searching articles published during 2000-2012 was done through library references, scientific databases. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Management Science
دوره 52 شماره
صفحات -
تاریخ انتشار 2006